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I. The dichotomy of small errors. 



There are two kinds of small errors which do not follow the same differential calculus. 

In order to understand this phenomenon, let us consider two researchers, two applied 
mathematicians say, who endeavor to perform random simulation rigorously. Both are 
expert in using inversion or rejection methods so that they are able to simulate any 
probability law as soon as they can draw a real number randomly in the interval [0, 1]. 

For this, congruence methods are of course available, statistically excellent, but instead 
they want to be able to assess the committed error : 

- the first one draws the binary digits by heads or tails. 

- the second one uses Polya's urn. 

Let us look at the biases and the variances of the errors after n drawings. 
In the first case X — U, CL1CL2CL3 ' ' ' and we approach 

00 n 
k=l k=l 



The tti are independent random variables with value 1 or with probability |, 
J-'n = cr(ai, . . . , an). Let us put 

bn = E[{x - Xn)\J^n] 
dn =^[{x-Xnf\Tn] 



We have 



6„ = E 



7; = f] -p 



1/2 _ 1 ^ _ 1 1 _ 1 1 



n+1 

In the second case , let us recall the rule of Polya's urn. At the beginning there are 
one white and one black ball in the urn and each time a ball is drawn out, it is put back 
into the urn together with another one of the same color. 

After n drawings, the ratio of white balls in the urn may be written 



Xn+iin + 3) = X,(n + 2) + l{f/„^,<x„} 
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where Un+i is uniformly distributed on [0, 1] independent of J^n = (^{Xq, . . . , 
In other words ^ 

Xn+l = Xn+ ^j-^(l{[/„+i<X„} - Xn). 

We see that X^, is a bounded martingale which converges a.s. and in L^, j9 G [1, oof, to 
Xoo and it is not difficult to prove that when the initial composition of the urn is one 
white and one black ball, X^o is uniformly distributed on [0, 1]. 
We have for the bias and the variance 



bn = E[X^-Xn\Tn]=0 
Vn = dn = E[{Xoo - Xnfl^n] with E[Vn] = + o{l / Tl) . 

on 

We see that in the first case the variances are smaller than the biases, and in the 
second case the biases are smaller than the variances. 



How will this propagate trough the computations of our two modelers ? 
Let us write the Taylor expansion of a C^-function with bounded derivatives. 

fiX)-fiXn) = iX-Xr^)fiXr^) + ^iX-XnfnXn)+^iX-XnfriXn + eiX-Xr,)) 

Bn = E[/(Xoo) - f{X^)\Tn] = bJ'{X^) + IdnfiX^) + 
Dn= E[(/(Xo,)-/(X„))2|^,] = dJ'\X^) + 0{dn) 

Three cases appear (supposing that the derivatives /' and /" do not vanish for simplicity, 
or that the sets of their zeros are negligible for the law of X„) : 

17 When the variance is neghgible with respect to the bias, Vn <^ bn, (first researcher), 
we have also dn <C bn, asymptotically the dominating term for the bias Bn is the first 
term bnf'{Xn). We see that Dn is negligible before Bn so that I4 ^ Bn, the situation 
will endure. The only useful formula is 

E[/(Xoo) - fiXn)\Tn] = bnf\Xn) + o{bn) . 

2°/ When the variance is of the same order of magnitude as the bias, this situation 
will persist {Dn and Vn are of the same order of magnitude as Bn) . 

37 When the bias is negligible before the variance bn <^ Vn (second researcher), the 
dominating term of the bias becomes ^dnf"{Xn) which is equivalent to ^Vnf"{Xn) and 
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we fall back into the second case (2°/) where the bias Bn and the variance Vn are of the 
same order of magnitude. 

In the cases 2°/ and 37, 6^ may be neglected before Vn and the useful formulae are 

nf{Xoo) - f{Xn)\J'n] = bnf'iXr,) + lvJ"{Xn) + 0^ 
nifiXoo) - /(^n))'|^n] = Vnf\Xn) + 0^)- 

We see that our first modeler may content himself with an error calculus based on the 
first derivatives. 

Instead, the second modeler, who uses Polya's urn, needs an error calculus for the 
bias and the variances. 

- the calculus for the variances is a first order calculus, 

- the one for the biases is a second order calculus which uses the calculus on variances. 

In practice, generally, we do not control the nature of the errors. In a modelization, 
errors on data are exogenous, we do not know exactly from where they come. Therefore 
it is wise to do as if we were in the second case, especially to take in account the stochastic 
nature of the errors and the non-linearity of the model. 

This may be said otherwise : to consider that a sensitivity analysis on the data or 
on the parameters may be done with the first derivatives, is to suppose that the errors 
on these quantities are representable by their mean. If, for more and more accurate 
measurements, the errors are thought as a stochastic process, this amount to consider it 
satisfies the ordinary differential calculus instead of the Ito calculus. We shall come back 
to this analogy later on. 

II. Intrinsic error calculi. 

In the error calculus on biases and variances, the calculus on variances do not involve 
the biases, it is natural to begin with it, and this takes us back to Gauss at the beginning 
of the XlX-th century. 

Gauss' error calculus on variances and covariances . 

Twelve years after his demonstration that, under some hypotheses, error follow the 
normal law. Gauss is interested in the propagation of errors {Theoria combinationis 
1821). Given a quantity U = F{Xi, X2, . . .) function of other quantities Xi, X2, . . ., he 
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consider the problem of computing the quadratic error on U knowing the quadratic errors 
(7i, (7^, . . . on Xi, X2, . . ., assuming these errors are small and independent. 
His answer is the following : 

and for another function V = G{Xi, X2, . . .) he gives the covariance of the errors on U 
and V: 

OF dG 2 OF dO 

Gauss didn't study the propagation of biases, which, as we shall see, is more delicate. 
The erring ways of textbooks : "ugly" formulae. 

In spite of the works of Gauss, during the whole XlX-th century and also in the 
XX-th century, have been taught in mathematical or physical textbooks very ambiguous 
formulae like 

dF OF 
(3) Ac;=|_|AX, + |_|AX. + ... 

as, for example, in the course Mathematiques generales (1947) of Vessiot and Montel, 
where the AXi, AX2, AU, are the absolute value of the estimated error. Sometimes (3) 
is justified (J. Taylor An Introduction to Error Analysis, University Science Books 1992) 
by the inequality 



y^ox/^' + • • • + ^ox/^^ ^ fel"^ + • • • + \oxJ^'- 

which would make Gauss' formulae (1) and (2) useless. In addition to the fact that in 
(3) it is not at all clear what are AXi, AX2, AU, these formulae are clumsy. 

It is not a question of taste or aesthetics, it is an aflFair of symbolism and concepts : 
with (3), \AU\ generally depends on the way the function F is written. By composing 
two linear maps with values in M? we see that the identity map increases the errors ... 
impossible to work properly in such conditions. 

This does not happen in the calculus of Gauss. To see this, we may introduce the 
differential operator 

r 1 2 1 2 

^=r'dx!^r'dxi^"' 
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and remark that (1) writes 

gI = LF'^ - 2FLF. 

Then the coherence is related to the transport of a differential operator by a function. If u 
and V are regular injective mappings and if we denote 6uL the operator i— ^ L{ipou)ou^^, 
we have dyouL = 9yi9uL). We shall make this more precise later on. 



Geometrization. 



Errors on Xi,X2,... may be supposed non-independent and may depends on the 
values of Xi,X2, . . . : we consider a field of positive symmetric matrices aij{xi,X2, ■ ■ ) 
on representing the conditional variances and covariances of the errors given the values 
Xi, X2, . . . of Xi, X2, . . ., and the error calculus becomes 

(5) al = ^ ^{xuX2, . . ■ ■ -^^^J*^^!' • • •) 

ij * ^ 

In order to geometrize the error calculus, i.e. to find a language for the biases and 
the variances which depends on the only mathematical objects and not on their written 
form, we proceed in two steps. 

First we argue on W^, setting about it in such a way that if 

F = fog = koh 

the error depends only on F. Then we shall extend this when F takes its values in a 
manifold. 

a) Let F be a regular function, say, of two variables x and y say. Let us consider 
the increment of F between {x,y) and {x + dx,y + dy). Here dx and dy are arbitrary 
increments. 

Taylor formula gives 

AF = F{x+dx, y+dy)-F{x, y) = Pi{dx, dy)+- • -\-^Pnidx, dy)+{\dx\+\dy\)''o{\dx\+\dy\) 

n! 

where 

/c' d^F 

Pu(dx,dv)= ^ — dxMv^ 

p+q=k ^ ^ ^ 
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is a homogeneous polynomial of degree k in dx, dy with coefficients depending on x,y 
which we shall call the differential of order k of F denoted d'^F. 
From the formula 

AF = dF-\- ]-d^F + • • • + —d'^F + {\dx\ + \dy\Yo{\dx\ + \dy\) 

we deduce the differential calculus of interest for us : if x = /(li, w) and y = g{u, w) 
and if Z{u^v,w) = F{f{u,v,w),g{u,v,w)) we have 

dZ = Pi{du, dv, dw) = ^dx + ^dy 

d^Z = P2{du, dv, dw) = dPF (quadratic terms in dx^ dy) 

+%<P^ + %<Py 



(6) 



(linear terms in d X^ d y, quadratic in du, dv, dw) 



what may be remembered by writing 

d^Z = d{dF) =d{%dx^^dy) 



dFi2^ 



dFi2„ 



= d{%)dx + %d^x + d{%)dy + %d\ 

b) Randomization . Formulae (6) are identically valid when the increments du^ dv, dw 
are supposed to be random. 

Simply the Landau symbols o(.) are now random and we must be carefull to the 
integrability when expectations are taken. 

Thus, we may write with capitals as is usual in probability theory : \i Z = F(X, Y) 



dZ= ^dX + ^dY 

ox ay 



OF, 



(7) 



'dydy 



5y2 



dy 



If the functions /, g and F have bounded derivatives and if the random variables dU, dV, dW 
are square integrable, we see that dX, dY and dZ are square integrable and d'^X, d^Y 
and d'^Z are in V-. 
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We introduce now the bias and variance operators A and F associated to the erroneous 
random variable {X, Y) : 

(A[F]{x,y)= ¥.[dZ+\d?Z\X = x,Y = y\ 

(8) { 

{ T[F]{x,y)= ¥.[{dZf\X = x,Y = y] 

Let us point out at once that in the formula giving A[F] we may or not put dZ, we shall 
come back on this, we let it for the moment. 

- Let us remark that if we apply A to the function F^, we obtain d'^{F'^) = d{2FdF) = 
2{dFf + 2Fd^F, so that 

= W^y[2FdF + {dFf + Fd^F] = 2FA[F] + V[F] 
V[F] = A[F'^] - 2F A[F]. 

- If we consider 



H[x, Y) = y), F2(X, y), . . . , Y)) 

we see by the change of variable formulae (7) that 



(10) 



i=l i,j=i- 



i,j=i ■> ■> 



the second relation generalizes that of Gauss to the case of possibly correlated errors. 

c) The analogy of formulae (10) with Ito's formula for semi-martingales and their brack- 
ets is striking. It may be made more precise as follows : 
Let us consider the vector field 

A[X]{x,y) = E[dX + ^d^X\X = x.,Y = y] 
A[Y]{x,y) = E[dY + ld'^Y\X = x,Y = y] 

and the field of matrices 



P/ nX]{x,y) nx,Y]{x,y) 

L[x,y) ^^x,Y]ix,y) mi^.v) 



and let Yl{x,y) be a regular square root of F, let us consider the diffusion process S 
solution to the stochastic differential equation 

r dSl= Y.n{St)dBl + Y.i2{St)dB'i + A[X]{St)dt 
\ dSf = ^2i{St)dBj + J^22{St)dB^ + A[Y]{St)dt 

where B = {B^, B^) is a standard Brownian motion. Ito's calculus gives 

+i[0(s,)r[x](s,) + 2^(s,)T[x,Y]{St) + 0(s,)r[y](s,)]rft 

and also, denoting (.)* the continuous finite variation part 

' idF,{S)tr = ^{S,)A[X]{St)dt + ^{St)A[Y]{Sf)dt 

+|[0(^Or[x](5,) + 2^(5,)r[x,y](5,) + giis,)m{s,)]dt 

(11) \ and 

so that the change of variable formulae for the biases and the variances of errors (10) are 
fulfilled if we were defining AfF^Ka;, y) and T[Fi, Fj]{x, y) by the relations 

the same for H and any regular function of X and Y. 
To sum up 

We see that the notion of erroneous random variable (X, Y) with values in may 
represented by 

a random differential operator of order 1 



dF dF 
and a random differential operator of order 2 



F ^ bfx,Y)lF] = —{X,Y)dX + —(X,Y)dY 



F - afx.Y)[F] = ^d'X + ^d'Y + ^{dXf + l^dXdY + '"^^{dYf 
^ ' OX oy ox^ oyoy oy' 
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where dX^ dY^ d'^X, d^Y are a priori any random variables. 

Then, for an error calculus retaining only the two first moments of the conditional 
law of the error given (X, Y) , we sum up by two (deterministic) differential operators 
the bias operator 

A[F]{x,y) = nh^xx) + l^xxmU^Y) = 

and the operator of variance 

T[F]{x,y)=n{hx,Ym?U,Y) = ix,y)]. 
and we have the change of variable formulae 

(12) { 

a) Let us remark that we have identically 

ihx,Y)[F]? = lo^ix,Y)[F'] - Fa^x,Y)[F] 

therefore 

(13) r[F] =A[F^] -2FA[F] 

the operator F may be deduced from the operator 

b) If instead of we had taken 

where c is a random field of first order differential operators, and so 

A[F] = E[b^x,Y)[F] + la^x,Y)[F] I iX,Y) = ix,y)] 
formulae (12) and (13) would be still verified with A, T remaining unchanged. 

Remarks a) and b) above show that, for propagating the errors, if we keep the only 
operators A and F and formulae (12), then operator A discloses some ambiguity for its 
first order terms. We might forget the first order operator b. This may be understood 
by the very nature of the notion of bias which needs an origin for reference. This 
reference changes if we add at the beginning a deterministic error, of parallax type, 
which propagates following a first order operator. 
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d) In manifolds . 

Let us recall that given a manifold M, a second order tangent vector at a is a differ- 
ential operator at point a, without constant term, of order < 2 , we denote Ta{M) their 
set and t(M ) the space of fields of second order vectors. 

A second order differential form is a C°°-function on r(M) linear on each Ta{M). We 
denote r*(M) the space of second order differential forms. 

Example : If / and g are real functions on M we may define the second order form 
d^f = A G r(M) 1—^ A(/) and then the second order form 

df-dg=^id\fg)-fd'g-gd'f). 

We say that A G t(M) is of elliptic type if <X,df ■ df> >0 V/. We denote r%M) the 
space of fields of elliptic type second order vectors. 

We consider a random variable X with values in M and, at each point x G M, a 
probability measure on r^{M) regular with respect to x. 

In other words, we consider a random field of second order tangent vectors of elliptic 
type (only its marginal laws of order one will be used afterwards). We denote this field 
AX : it is the "error" on X. 

Then we define 

r A[f]{x)= lE[<d'f,AX> \X = x] 
(14) { 

[ r[/](x)= n<df-df,Ax> \x = x] 

A is a field of second order vectors and F is a field of bilinear differential operators of 
positive type given by 

m = A[f]-2fA[f]. 

If we consider h = (^(/i, /2, . . . , //c), denoting a coordinate system, we have 

/2, . . . , fk)] (x) = [< AX, d'h >] 

= iE^[< AX, J2i Dih d^x' + Dijh dx' ■ dx^ >] 

n^ifu /2, • • • , fk)][x) = E,[< AX, dh ■ dh>] 

= E^[< AX, Dih Djh dx ■ dx^ >] 

with 
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thus, getting rid : 
(15) { 

[ r[^((/i,...,/,)]= Ep,,^>;r[/„/j. 

The interpretation in terms of diffusion processes is similar to that of the flat case. We 
know that if the Ito differential is dF{S)t, its flnite variation part {dF{S)ty is intrinsic 
(cf. Paul- Andre Meyer, "Geometric stochastique sans larmes" in Sem de Probabilites 
XV, Lectures Notes 850, Springer 1981, p 51), with the interpretation given in the flat 
case, F i— ^ A[F] taken in St is the second order tangent vector of the local characteristics 
of 5. 

Let us remark that if we add to the error AX a random flrst order tangent vector, 
changing AX in AX + 6, formula (14) becomes 

MfK^) = l^A<df.b> + <d'f,AX>] 

because c?^/|tm = df (cf. P. -A. Meyer op. cit., p 49) what does not change F and Ai, 
still satisfles formulae (15). 

Hence we see that the interpretation in terms of errors agrees with the fact that on a 
manifold, the flrst order part of a second order tangent vector is not deflned [except if a 
linear connection is available to share the deterministic and the stochastic part (contin- 
uous local martingale) of the error]. Already in the flat case, the flrst order part of the 
bias needs a convention at the start. 

Remark. Why did we limit ourselves retaining from the conditional law of the error only 
the two flrst moments, since the formalism of n-th order differential forms would make 
it possible to deal with the propagation of the other moments ? First let us say for the 
sake of simplicity, but a more precise answer will be given later on ( Remark in part IV 
page 23). 
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III. Complete and symmetric error calculus. 



a) The symmetry as invariant. 

If we start from a situation where the error is centered, by a non-Unear mapping the 
error is no more centered. 

What is preserved by image ? 

If Xn is an approximation of X, and if the joint law of the pair (X, X„) is symmetric, 
then the law of ('0(X), is of course symmetric too. Concretely this represents 
situations where we ignore between two close random variables Xq and Xi which is the 
right one which is the erroneous one, and by this hesitation we work with (X^, Xi^b) 
where 5 is a Bernoulli independent variable. 

Let us take the notation of the flat case. If (X, X + AX) is a symmetric pair, we have 

F{X + AX) - F{X) =dF + \(fF + remainder 
G(X + AX) - G(X) = + \d[^G + remainder 

so 

E[F(X+AX)G(X)-F(X)G(X+AX)] 

= E[G(X)(dF + \(fF) - F{X){dF + \(fG) + remainder ] 
= E[G(X)^[F](X)] - E[F(X)^[G](X)] + remainder 
=< G, AF >jy - < F, AG + remainder 

denoting u the law of X. 

We see that as soon as the pair (X, X + AX) is symmetric and the errors small, 
the operator A is symmetric with respect to the law of X. This property of symmetry 
(which solves the ambiguity of the first order terms of A) is an invariant by image : 
the bias operator of the error of the image of X by an application ip is symmetric with 
respect to the law of (^(X). 

The symmetric case allows to construct a much more powerful framework thanks to the 
theory of Dirichlet forms, and this framework extends easily to the infinite dimension (cf. 
N. Bouleau, Error calculus for finance and physics, De Gruyter 2003). We sketch it now : 
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b) Error structures 

An error structure is a term 

(^7,^,p,D,r) 

where {U, A, P) is a probability space, satisfying the following properties : 

1. ) D is a dense sub-vectorspace of L^(n,^, P), 

2. ) r is a symmetric bilinear map from D x D into L^(P) satisfying the Gauss calculus 
of class n Lip, 

i.e. if u G O"^ and v G O'^, and if F and G are of class and Lipschitz, from R"^ [resp. 
W] into R, then F o u eB and G o v eB and 

r[Fou,Gov] = J2Fliu)G'j{v)r[ui,Vj] P-p.s., 

3. ) the bihnear form S[f,g] = EFf/, 5^] is closed, 

i.e. P is complete for the norm || . ||d = (|| • Ili2(p) + ^[ • , • ])^, 

4. ) 1 G © (hence r[l, 1] = 0, markovianity) . 

We write £[f] for £[f, f] and r[/] for r[/, /]. 

With this definition, the form £^ is a Dirichlet form (local and admitting a square field 
operator) . 

To this form corresponds a Dirichlet operator A (generator of the semi-group associ- 
ated to E) which, under suitable hypotheses on F, satisfies : 

A[F o w] = ^ i^' o w A[ui\ + ^ ^ w nui, Uj] P-p.s.. 

i i,3 
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Example 1. ( Ornstein-Uhlenbeck structure in dimension 1) 

n = R 

A = Borel d-field ^(M) 
P = jV(0, 1) reduced normal law 
D = H^{J\f{0, 1)) = {ue L'^{F),u' in distributions sense 

belongs to L'^(¥)} 

r[u\ = u'^ 

then, (E,^(E),7\/'(0, l),if^(A/'(0, l)),r) is an error structure. We can also obtain the 
bias operator (the associated generator) : 

VA = {f e L2(P): /" - xf in the sense of distributions G L2(P)} 

and Af = ^f" — • f where / is the identity map on R. 

Example 2. ( Monte-Carlo structure in dimension 1) 

n = [0, 1] 

A = Borel (j-field 
P = Lebesgue measure 

D = {w G ^^([0, 1], dx): the derivative u' in the sense of distributions 

on ]0, 1[ belongs to L^([0, 1], dx)} 
r[u\ = u'\ 

the space B thus defined is denoted //^([0, 1]). 

Example 3. Let U he a domain (connected open set) of R'^ with unit volume, B{U) the 
Borel (7-field and dx = dxi, . . . dxd the Lebesgue measure, 

D = {u G L'^{U, dx): the gradient Vu in the sense of distributions 
belongs to L^{U, dx]R'^')} 
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Then {U, B{U), dx,I]),r) is an error structure. From the relation S[f,g] = {—Af,g) it 
foUows that the domain of the generator contains the functions of class with compact 
support in U, VA D C]^{U) and that for such functions 



2 ^ dx] ' 

i—l * 

Example 4. 

Let D be an open set in with unit volume. Let ¥ = dx he the Lebesgue measure 
on D. Let F be defined on C^{D) by 

where the functions a^j satisfy the following hypotheses 
•a^jELl^iD) ^gLL(/^) ^,j,/c = l,...,(i 

• ^ a^j{x)^^^j > V<^ e Vx e L> 

• aij{x) = aji{x) yx E D. 

Then the pre-structure (L>, ^(D), P, C^(L'), F) is closable. 

Comment on the completeness. 

The fact that we ask the form S be closed is a restriction, but this restriction is 
highly fruitful. The situation is quite analogous to the question of the cr-additivity in 
probability theory : without this property, nothing can be said on the transmission of 
errors by objects defined by limits. But many objects in contemporary mathematics 
are defined by limits (integrals, solutions to ode, solutions to sdp, stochastic integrals, 
solutions to sde, etc.p 



^Thc philosopher Karl Popper is fallen in this trap by emphasizing that his own theory of probability (additive) do 
contain strictly the one of Kolmogorov (cr-additive) Cf. N.Bouleau "Some thoughts upon axiomatized languages, a focus on 
probability theory and error calculus with Dirichlet forms" ButUeti de la Societat Catalana de Matematiques Vol. 18 n2 



p25-36, (2004) cf. http : //hal . archives-ouvertes . f r/hal-00105636 
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This extension tool allows 

- to extend the error calculus on variances from C^-functions to Lipschitz functions, 

- to prove a criterion of existence of density for the probability laws which generalizes 
Malliavin's method : the image energy density property. 

The closedness property is preserved by images and products even infinite products. 
This permits to construct naturally errors structures on the spaces of stochastic processes. 

In particular on the Wiener space, where may be obtained, among others, the Ornstein- 
Uhlenbeck operator. As well, the Malliavin calculus may be interpreted as an error cal- 
culus. 

Theorem on products 

Let Sn = {^n, Anj^nj^n^n) , u > 1, be error structures. The product structure 

oo 

S = {n,A,F,B,T) = l[Sn 

n=l 

is defined by 

/ oo oo \ 

\n=l n=l J 

D = |/ £ I/^(P): Vn, for almost every wi,W2, . . . , Wn-i, . . . 

for the product measure 
X f{wi, . . . , Wn-i, X, Wn+i, and 

SY.n^n[f]dF<+^] 

and for / £ B 

oo 

r[/] = Er»[/l- 

n=l 

S is an error structure, Markovian if each Sn is. 

When we write r„[/], acts only on the n-th argument of /. 



17 



Starting from the basic brics that are one dimensional errors structures, by product 
we obtain error structures on function spaces. 
This yields easily error structures on 

- the Wiener space, cf. N. B. h F. Hirsch, Dirichlet Forms and Analysis on 
Wiener Space, De Gruyter 1991, 

- the general Poisson space, the Monte Carlo space, cf. N. B. Error Calculus for 
Finance and Physics, De Gruyter, 2003. 

Images of error structures. 

The operation is as mere and almost as general as the image of a probability measure 
by a measurable map. 

If {Q,, A, P, B, r) is an error structure and X a random variable with values in M.^ whose 
components are in D, the term {W^, B{M.^),Fx, ©x, fx) is an error structure, where 

¥x is the law of X, 
Bx = {f e L^Fx) : foX eB} 
Tximx) = E{r[foX]\X = x}, /GO. 

In fact it is possible to define images by more general random variables. 

The image structure by X may be called the "Dirichlet-law" of X. It is an error 
structure on M.^ such that the identity map / has its components in the domain of Fx 
and the following formulae hold : 

E[r[X]|X] =Tx[I]oX 

E[r[if{x)]\x] =rx[ip]oX. 

Several theorems of probability theory possess analogs in the theory of errors struc- 
tures under some conditions : Gateaux-Levy, Strassen, etc. (cf. Bouleau-Hirsch [de 
Gruyter 1991].) 
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The case of Wiener space. 

• Let (Xn)neN orthonormal basis of L^(R+, dx), and let (pn)neN ^ sequence 
of i.i.d. reduced Gaussian random variables. 

To a function / G dx) is associated the Wiener integral 

n 

homomorphism of L^(M+, c^x) into L'^{Q, A, P). If we put 

B{t) = V(l[o,t], Xn)Pn = f ^"(^) • 

then is a standard Brownian motion. 

Because of the case where / is a step-function, /(/) is denoted /(/) = /(s) dBg 

• The preceding construction uses the product space 

{n,A,F) = (M,^(R),A/'(0,l)f , 
the QnS being the coordinate maps. If we put an error structure on each factor 

(M,^(M),A/'(0,l),d„,7,), 
we obtain an error structure on {Q,, A, P): 

00 

[n, A, P, D, r) = ^(R), A/'(0, 1), d„ 7n) 

n=0 

such that a random variable F{gQ, gi, . . . , Qn, . . .) belongs to D iff 
r Vn x^ F{go, . . . , g^-i, x, gn, ■ • •) belongs to d„ P-p.s. 

I and r[F] = X^„7n[^] belongs to L^(P) (7^ acting on the n-th variable of F). 
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• If on each factor we take the Ornstein-Uhlenbeck structure in dimension 1, we obtain 

rbn] = 1 

r[pm, 9n] = if m ^ n. 
For / e from /(s) dB^ = Xn)gn we deduce 



n 



r 



Jo 



2 



hence, using the functional calculus VF G n Lip(R"^) 



r 



f(| Ms)dB,,...,J Us)dB)j^=Y,^^Jmf,{s)ds 



It is the Ornstein-Uhlenbeck structure on the Wiener space. 



• For the sake of simplicity let us restrict the time to t G [0, 1] and let us take for Xn the 
trigonometric basis. If on each factor we take the Ornstein-Uhlenbeck structure scaled 
by a constant coefficient depending on n : 

oo 

J](R, i3(R),7V(0, 1), H\M{Q, 1)), w ^ (27rn) V) 

n=0 

we obtain on the Wiener space an error structure satisfying on the first chaos : 



r 



f{s)dB, 



= [\f^'^^ns)ds 

Jo 



where /*-^^ is the q-th derivative of /. It is a structure where the error disturbs longitudi- 
nally the Brownian path, which belongs to the family of structures called of "generalized 
Mehler type". 
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IV. The four bias operators. 



We tackle now the following question : How an error, in the usual sense in mathematics, 
i.e. an approximation error, generates an error structure. 

Let us consider a random variable Y defined on the probability space {Q,, A, P) with 
values in the measurable space {E, T) and approximations 1^, n G N, also defined on 
(fi. A, P) with values in (E, JT). 

We suppose there exist an algebra V of bounded functions from E into M or C dense 
in L^(£^,^, Py) containing the constants and a sequence {an)nm of positive numbers, 
about which we consider the following hypotheses : 

mn / V(/? G P, there exists 'A[ip\ e L^iE^J^^Fy) s.t. Vx G P 

the expectation Ey being relative to the law Py. 

/ V(/? G P, there exists A[ip] G L^{E,T,¥y) s.t. Vx G P 



(H2) 



(H3) 



yip G r>, there exists A[lp] G L'^{E, T, Py) s.t. Vx G P 
lim,^oo«nE[(^(i;) - ip{J)\x{yn) - X{y))\ = -2Ey[A[^]x]. 

As soon as two hypotheses among (HI) (H2) (H3) are fulfilled (with the same algebra 
V and the same sequence a„) , the third one follows thanks to the relation 

2 

• When defined the operator A which considers the asymptotic error from the point 
of view of the limit model, will be called the theoretical bias operator. 

• The operator A which consider the error from the point of view of the approximate 
model, will be called the practical bias operator. 

• Because of the property 

< A[(p],X>L^i¥Y) = < ^:^[x] >L2(P^) 

the operator A will be called the symmetric bias operator. 
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The following result shows that a Dirichlet form (possibly non-local) is often present 
behind an approximation : 



Theorem. Under hypothesis (H3) 
a) the limit 



e[^,x] = lim^E[(^(yO - ^iY))ixiYn) - xiY)] 

n z 



defines a closable bilinear form whose smallest closed extension is denoted (^,B). 

b) {S^I}) is a Dirichlet form 

c) B) admits a square of field operator T satisfying \/(p, X ^ ^ 



We introduce now the fourth bias operator )^ defined under (HI) and (H2) on V by 



Since Ey[)^[(/?]x] = lim„ E[((/?(y„) - (/?(y))(x(l^) + x(K))/2] we see that )^ represents the 
asymptotic error from the point of view of an outside observer who attaches the same 
weight both to the theoretical and the practical models and measuring the error alge- 
braically on the same axis. Because of properties we shall state below, the operator )^ 
will be called the singular bias operator. 

An operator B from V into L^(Py) is said first order if it satisfies 



Mmx] = limanEMYn) - ip{Y))\xiYn) + xiY))/2] 




B[ifx]=B[if]x + ^B[x] 
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Proposition. Under (HI) to (H3) 

a) the theoretical variance lim„ anE[{(p{Yn) — (p{Y))^'ijj{Y)] and the practical variance 
lim„ anE[{(p{Yn) — (p{Y)yip{Yn)] exist and we have V(/?, x,"^ 

iim.«,E[((^(i;) - ip{Y))ixiYr,) - xiY)my)] = MS^i^lx + Mi^W -AMx^P] 

lim, anEMY^) - ^(F))(x(^n) - x(^))V^(^n)] = Ek[-^Mx + ^[V^]^X - A[^M 

b ) These two variances coincide if and only if )^ is first order, and then they are equal 
to Ey[r[(/?]V']. 

c) If the Dirichlet form is local, then ^ is first order. 

Remark. Under (H3) condition d) of the theorem (p 22) caracterising the case where the 
form 8 is local is equivalent to either one of the following conditions : 

(j)3A>2 lim„a„E[|^(y,)-v?(y)|^]=0 G P. 

(jj)VA>2 lim„a„E[|^(y,)-^(y)|^]=0 G P. 
This yields an answer to the question put on page 12 about an error calculus with higher 
moments using tangent vectors of order greater than 2 : In the cases where the Dirichlet 
form is local, (the only case in which we are able to propagate errors thanks to a dif- 
ferential calculus) the moments of order greater than 2 are negligible before the first or 
second order moments. 

Let us come back to the case where only hypothesis (H3) is assumed. The following 
result shows that, for the variances, the error calculus performed on the limit model with 
the asymptotic error, coincides with the error asymptotically obtained on C^-functions. 

Proposition. Under (H3). // the form {8, D) is local, then the asymptotic error calculus 
principle is valid on 

P = {F(/i,...,/J : fieV, FeC\W,R)} 

I.e. lim, a,E[(F(/i(y,), • • • , fpiXr,)) - F{f,{Y), . . . , /^(y))^] 

= Er[EL=i ^/(/b • • • ' • • • ' fp)nf^. fj]]- 

Examples 

• Let us take for Y a Brownian motion B indexed by [0, 1] as random variable with 
value in C([0, 1]) and let us take for Y^ the approximation Y^ = B + y^W where W is 
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an independent Brownian motion. We may apply the theorem taking for V the hnear 
combinations of functions ^{B) = e*-^o f'^^ with regular / say C^. 

Hypothesis (H3) is fulfilled. The theorem yields the Ornstein-Uhlenbeck structure on 
the Wiener space. 

• Series with independent increments, let be 



°° V '7 



n=l 

where Xn, £ L'^^^ , centered, Z„) i.i.d., on and S is approximated by the par- 
tial sum Sn = ELi t + f • 

By Burkholder's inequality, we have nE^S* — 5^1^"^^] — ^ as n — ^ oo. So that, taking 
r> = Cf, for if.xeV 

limnE[(^(5) - ifiSn))xiSn)] = limnE[(5 - Sn)^'iSn)xiSn) + ks - Sn)V iSn)x{Sn)] 

n n Z 

= ^E[Z!]E[ip"{S)x{S)]+E[X,]Ey'{S)x{S)]. 
Hence hypothesis (H2) is satisfied and 

Similarly 

limnE[(^(5) - ^{Sn)f] = limnE[(5 - SnW\Sn)] = E[Z!]E[^'\S)] 

n n 

(H3) is fulfilled as soon as the law of 5 satisfies the Hamza condition (cf. M. Fukushima 
et al. [1994]) and then the Dirichlet form is local, and r[ip] = A[ip'^] - 2ipA[ip\ = E[Zl]ip'^. 

• Stochastic intergral. Let us consider the integral 

Y = [ HsdBs 
Jo 



approximated by the sum 



n-l 



Yn = y^Hk{Bk+i - Bk) 



k=0 
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(Bt) is a standard Brownian motion defined as coordinate process on the space C([0, 1]) 
equipped with Wiener measure, Hg = Ho-\- Jq dBu + t]u du is an Ito process regular 
in Mahiavin's sense. Under suitable hypotheses we obtain : 

- Hypothesis (H3) is fullfilled under rather simple regularity assumptions and 

- Hypothesis (HI) supposes finer regularity conditions and 

< AM,x > = \E[J^ i,D,D,[v'{Y)x{Y)]ds] + lE[J„\,,D,,[.f'{Y)x(Y)]ds] 

where D denotes Malliavin's gradient on the Wiener space. Then )^ is a first order op- 
erator. 

• Stochastic differential equations and the Euler scheme. The approximation error of 
the solution of an SDE by the Euler scheme has been the subject of many works whose 
one of the achievements is a form of functional central limit theorem. See especially 
Jean Jacod and Philip Protter "Asymptotic error distributions for the Euler method 
for stochastic differential equations" Ann. Probab. 26, 267-307, (1998) and the quoted 
references. This central limit theorem allows to simplify the study of the limit 

lim anElMYn) - ^iY)f] 

n^oo 

but the existence of the operator A, and therefore the closability of the form are yet 
only proved in dimension 1. On the other hand the operator A has been determined by 
Paul Malliavin and Anton Thalmaier "Numerical error for SDE: asymptotic expansion 
and hyperdistributions" Note C. R. A. S. si, vol 336, nlO, p851, 2003, its appearance is 
similar to the one given above in the case of a stochastic integral. 
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V. Statistics and errors. 



In the error calculus by Dirichlet forms any erroneous quantity is random. The a pri- 
ori probability law may be thought as "the field and the use" of the measurement device. 

How to determine by experiment the operator F of an error structure ? 



Let us suppose that the error structure to be identified be on 

(R'^,i3(M^),P,B,r). 

Concretely {M.^, B{M.^),F) is the image space of a quantity x which is measured with some 
accuracy. P is its a priori law. 

We shall consider that to carry out a measurement of the quantity x, is to estimate x 
statistically as parameter of a family of probability measures Qx- 

It is known that if T is an estimate of x unbiased say (Qa;[T'] = x), the precision on x 
is limited by the inequality 

Qx[iT-x){T-xy]>[Jix)]-' 

with equality if T is efficient (Cramer-Darmois-Fisher-Rao inequality). Let us recall the 
definition of the Fisher information and the Cramer-e^-al. inequality. 

Let be a; G R*^ and let Qx be a family of probability measures on some space dominated 
by the probability measure Q 

Qx = L{x, .) Q with L(x, .) regular in x. 

Then for every random variable Y G L^{Q) 

Ex[Y - Ex{Y)]' > (gradE,(y))V(^)]" VadE,(y) 

where J{x) is the Fisher information matrix of the model 



Let / be the identity map from 'Mr into itself, J{x) behaves as an information, F[/](a;) 
is a precision. 

We assume the identification : 
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r[i]{x) = j-\x) 



If we choose this way r[/](a;) = J ^{x), since F satisfies the functional calculus, if 
/ : — ^ is of class n Lip, we have 

mix) = (grad/)^r[/](x).grad/ 



In other words the operator F is uniquely determined, and similarly for all image struc- 
tures of our error structure [R'^, B{R'^),¥, B, T). 



Hence the question arises to know whether we obtain a precision compatible with this 
calculus when for / one to one we measure f{x). The answer is poisitive. 

Under the hypotheses of a "regular" statistical model, if we consider y = f{x) for 
/ G n Lip and one to one, the error structure obtained for y is the image of the error 
structure obtained for x by the map /. The identification is also stable by products 
in a natural sense. Cf. N. B. and Chr. Chorro, "Structures d'erreur et estimation 
parametrique". Note C.R.A.S., Ser.I 338 (2004), 305-310. 

That means that the error on x obtained this way do not depend on parametrization 
and possesses a physical meaning. 
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